SYMMETRIC AND CENTERED BINOMIAL APPROXIMATION 
OF SUMS OF LOCALLY DEPENDENT RANDOM VARIABLES 



By Adrian RolliiJ^ 

Stein's method is used to approximate sums of discrete and locally 
dependent random variables by a centered and symmetric Binomial dis- 
tribution. Under appropriate smoothness properties of the summands, the 
same order of accuracy as in the Berry-Essen Theorem is achieved. The 
approximation of the total number of points of a point processes is also 
considered. The results are applied to the exceedances of the r-scans pro- 
cess and to the Matern hardcore point process type I. 

1. Introduction. The approximation of sums of dependent random variables 
by the standard normal distribution has been investigated in a large variety of 
settings. The accuracy of approximation is most often measured by the Kolmogorov 
and Wasserstein metrics. The use of stronger metrics typically requires that some 
'smoothncss'-condition must be satisfied. 

In this paper, under the assumption of a general local dependence structure, 
we study the approximation of sums of discrete random variables by a symmetric 
and centered Binomial distribution. This distribution serves as replacement for the 
normal distribution in a discrete setting. Under some general smoothness property 
of the summands, the same order of accuracy as in the Berry-Essen Theorem can be 
achieved, but now for the total variation metric. We also examine another metric, 
from which local limit approximations can be obtained. 

In the setting of independent summands, approximation by a centered Poisson 
distribution has been successfully adopted by Cckanavicius and Vaitkus and 
Barbour and Cekanavicius However, for dependent summands, applications were 
limited to simple examples; first attempts were made by Barbour and Xia Q and 
Cckanavicius and Vaitkus In contrast, the results in this paper are of general 
nature and allow a wide range of applications. 

The proofs are based on Stein's method for distributional approximation. A main 
idea, introduced in Rollin [2(3 ] , is to use interpolation functions to represent the Stein 
operator of a discrete distribution as the Stein operator of a continuous distribution. 
In the case of the Binomial, this then allows the application of standard techniques 
in Stein's method for normal approximation. A careful analysis of the remainder 
terms then shows how a suitable smoothness condition can be exploited, to obtain 
total variation error bounds. 

The paper is organized as follows. In the next section, we introduce the main 
technique in the simple case of independent summands. In section 3 these results 
are extended to locally dependent summands and section 4 shows their application 
in some examples. Section 5 contains some technical lemmas. 
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1.1. Notation. Denote by Bi(n,p) the Binomial distribution with n trials of 
probability p each. Denote by Bi(n,p) the centered Binomial distribution, i.e. a Bi- 
nomial distribution shifted by — np. Note that this distribution does not necessarily 
lie on the integers, but on a lattice of R with span 1. 

Throughout the paper, we shall be concerned with two metrics for probability 
distributions, the total variation metric c?tv and the local limit metric di oc , where, 
for two probability distributions P and Q, 

o!tv(P,Q) := sup\P(A)-Q(A)\, 
Acn 

dioc(P,Q) := sup|P([a?,a:+l)) - Q([x,x + 

For simplicity, we will often use the notation di , where I — 1 will stand for dxv and 
I = 2 for dioc- 

We denote by || • |j the supremum norm if applied to functions, and the variation 
norm if applied to measures. Let 8 X denote the unit mass at x £ R, and * the 
convolution of measures. Define for any measure fj, and any I € N := {1, 2, . . . } 

D l (fi) = \\ f i*(6 1 -6 )* l \\. 

Note that for measures [i and A, 

(1.1) D 1 (fj)=2d rv ( f x,fi*6 1 ), 

(1.2) D 2 (p* A) < D 1 (p)D 1 {X). 

Furthermore, define (x) := x — \_x\ to be the fractional part of x € R, and 
(x)+ =xV0. 

1.2. Basic setup. Consider a sum of the form W = X^ieJ £*' w t 1Gre W takes its 
values in a lattice of R with span 1. The expectation of W has no influence on the 
quality of the approximation, and we therefore assume without loss of generality 
that WW = 0; this can always be accomplished by subtracting the expectation 
from each individual summand. Each of the summands may now take its values on 
a different lattice; this, however, will result in no further complications. 

To approximate W by a centered binomial distribution, we have to choose n 
in such a way that the variance of Bi(n, 1/2) is as close to the variance of W 
as possible. As n has to be integer, this is only possible up to a rounding error. 
However, the symmetric and centered Binomial distribution thus chosen will in 
general take its values on a different lattice from W and the total variation distance 
will become 1. To circumvent this problem, we introduce an additional parameter 
t and approximate W by a centered Binomial distribution with success probability 
1/2 — t instead (t being small), to be able to match not only the variance but also 
the lattice. 

Hence, to put the above in a rigorous form, we will make the following assump- 
tions if not otherwise stated: 

Assumptions G: Let J be a finite set and let £ J} be a collection of random 
variables with E£j = for alH € J and assume that there are numbers {a^ 6R;ig 
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J} such that almost surely & e Z + a*. Let = X)ie./ t nen = ^ anc ^ a l mos t 
surely W g % + a for a := X^eJ a *- Assume that er 2 := VarW > 1. Define now 
8 := '(-4cr 2 ) and t := (a + 2a 2 + 5/2) /(4ct 2 + 8). Clearly, 4a 2 + 6 = \4a 2 ] , and by 
definition the distribution Bi(|~4er 2 ] , 1/2 — t) has expectation 0; it is also easy to 
check that it takes values in % + a. 

From the above definition, we see that t is only of order 0(<7 -2 ), which is rather 
small in the setting that we are concerned with; Corollary 12 . 31 shows how to obtain 
results without t, using Lemma 15.21 

2. Sum of Independent Random Variables. First, we examine the case of 
independent discrete summands. Previous work on total variation approximation 
has been concerned with the compound Poisson distribution (see Le Cam and 
Roos and references therein), the signed compound Poisson distribution (see 
Cekanavicius || and references therein), the Poisson distribution (see Barbour et al. 
Q), the centered Poisson distribution (see Cekanavicius [HJ, Cekanavicius and 
Vaitkus 01 j Barbour and Xia Q and Barbour and Cekanavicius Q) and some 
more general distributions (see Brown and Xia ||). 

We present the theorem below to demonstrate the main technique in a simple 
setting, noting that it also follows as a consequence of Theorem 13. II 

Theorem 2.1. Let i g J} be independent and satisfy Assumptions G. Then, 
if the £i have finite third moments, 

di(j?(W),Bi(\4a 2 ],l/2-t)) < a" 2 (j^ ci.iPi + 1-75) , 1 = 1,2, 

where p, = of + ±E|&| 3 , of = Var& and c u = D l (^f(W - &)). 

It is clear that the above bound is useful only if the c; ; j are small. In the case of n 
identically distributed random variables, we need cu = o(l) as n — ► oo for asymp- 
totic approximation in total variation, and in order to deduce a local limit theorem 
we must have ci,i = o(n" 1 / 2 ). This is however always the case if D 1 (Xi) < 2 (this 
corresponds to the usual condition in the LLT that X\ must not be concentrated 
on a lattice with span greater than 1), as can be seen from (|5.9|) (|5.10J) . and we 
then even have cj j = 0(n~'/ 2 ) for I = 1,2. 

Before proving the theorem, we start with a short summary of Stein's method 
for Binomial approximation; for details see also Stein j2^| and Ehm [l^. Denote by 
F(M) the set of all real valued measurable functions on some given measure space 
M. A Stein operator B : F{7L) — ► F(W,) for the Binomial distribution Bi(n,p) is 
characterized by the fact that, for any integer valued random variable W, 

(2.1) M(Bg)(W) = for all bounded g g F{%) W ~ Bi(n,p), 

and a possible choice is 



(2.2) {Bg)(z) = qzg(z- 1) -p{n- z)g(z), for all z g 7L, 
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where, as usual, we put q = 1 — p. 

Let h G F{7L) be a bounded function. Then, the solution g = gh to the Stein 
equation 

(2.3) (Bg)(z) = I[0 sC z ^ n]{h(z) - E/i(Y)}, for all z G %, 

where Y ~ Bi(n,p), is also bounded. If the functions h are of the form h(z) — 
}ia(z) = I[z G A], A C TL y we have the uniform bound 

( 2 - 4 ) " A ^" < (n + l)/ g ' 

where Ag(z) := g(z + 1) — 5(2;), and the same bound holds for ||<7{6}||, b G 7L; see 
Ehm jjjj. Now, for all z G Z, we can write 

/[z G A] - P[Y G A] = (Bg A ){z) + I[z $ {0 . ..»}](/[* G A] - P[Y G A]), 

and thus, for any integer valued random variable V, 

drv(jSP(V0,Bi(n,p)) = sup|P[U G A] -P[Y G A]| 

(2 5) AcZ 

< su P |E(B5a)(^)| + P[|V-n/2| > n/2l. 

We now construct a Stein operator for the centered Binomial distribution Bi(n, p) 
on the lattice 7L — np. For any function g G -F(Z) define the function 9 G F(Z — np) 
by g(w) := .9(10 + np) for u> G Z — Tip. Then the Stein operator is defined as 

0g)(w) := (Bg)(w + np) 
(2.6) = p(w + np)g(w + np) + q(w + np)g{w — 1 + np) — npg{w + np) 

— w(pg(w) + qg(w — 1)) — npqAg(w — 1). 

for all w G TL — np. Thus, for W — V — np, an inequality corresponding to (|2.5(l 
holds, namely 

*rv(jSf(W0,Bi(n J p)) 

< sup |E(BaB)(W)|+P[|W + n(p-l/2)|>n/2]. 

An equivalent inequality holds for the d\ oc metric, but the supremum is taken only 
over the sets {b}, b G % — np. 

Under the assumptions of the theorem, n = [4er 2 ] = 4er 2 + 5 and p = 1/2 — t, 
and H2.6f> becomes 

(2.8) (Bg)(w) = wQg(w - 1) - a 2 Ag(w - 1) + (/j 2 (4ct 2 + S) - wt-5/4)Ag(w - 1), 
where Qg(w) := \{g{w + 1) + g(u>))- Since cr 2 > 1, the bound (|2.4|l simplifies to 

(2-9) \\Ag B \\ < -i. 
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To see this, note that t < l/\4a 2 ] = l/n and n = |~4er 2 ] > 5. Then from we 
have 

■ i . 1 1 1 1 4n 2 4 1 



(n + l)pq (n + l)(l/4 - t 2 ) (n + l)(n 2 - 4) 



71 (T^ 



Lemma 2.2. Assume the conditions of Theorem \2.1\ Define A : F{7L + a) 
FCE + a) by 

(Ag)(w) := wQg{w - 1) - a 2 Ag{w - 1), w G Z + a, .g 6 F(Z + a). 

Then, 

(2.10) |ECAs)(W0| < (||A.gj| A (llffll Z)c2,iW 

iSJ i6,7 

Proof. For every w e Z + a and i£ [0, 1) define 

(2.11) /(w + x) := Og{w - 1) + xAg(w - 1) + ±2; 2 A 2 5(7i; - 1). 

One easily checks that / € C 1 and = @g(w — 1) and /'(u>) = Ag(w — 1), 

hence 

(2.12) C4$)(«0 =«>/(«>) -trV'M, 

for all w E 7L + a. Furthermore, /' is absolutely continuous, hence /" exists almost 
everywhere. Choose /" to be the function 

(2.13) f"(w + x) = A 2 g(w-1) 

for all w G % + a, < x < 1. 

We can now apply the usual Taylor expansion (cf. Reinert [l^, Theorem 2.1), 
but with a refined estimate of the remainder terms. Write W% = W — fj, i £ J; then 

fi 



= u(Wi) + ztrm) + ef / (i - s)f{w i + s &) 

Jo 

°if(W) = of /'(Wi) + ^cr? / /"(Wi + s&) ds, 

Jo 

and hence, using the independence of £j and and that Ef j = 0, 

|E{6/(W0 - of /'(W0}| < E e| / (1 - s)M[f"(Wi + S&) | Zi\ ds 

Jo 



(2.14) 



-fiof f M[f"(Wi + 8Zi) | fi] ds 

Jo 



Note now that for any real valued random variable U taking values on a lattice 
with span 1, we obtain together with Q2.13|l 

(2.15) \w(f»(U + z))\ < (\\Ag\\D l (^{U))) A (\\g\\D 2 (J?(U))) , 
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for all z S B, Thus, from ifTTIjl and (ITTBI 

|e{6/W-^ 2 /'W}| 



(2.16) 



A.g||^(^(^))K + iE|^| 3 )) A (j|.g|| J D 2 (^f(W i ))(^f + ^16 
Now, using (|2.12(l we have 

|E{^(W)}|<^|lE{6/W-^/'(W)}| 

and with (|2.16() the lemma is proved. □ 

PROOF of Theorem 12. II Recall that, by Assumptions G, the distributions 
^f(W) and Bi(|~4er 2 ] , 1/2 — t)) are concentrated on the same lattice. Thus, us- 
ing (|2.7(l and the form ({2.8(1 of the Stein operator, and applying the left side of the 
minimum in (|2. to the first part of 1(2.8(1 with the bound 1(2.91) gives 

"TV 

(2 ' 17) < T.ieJ^iPi + tH^ ± S)pi ± 5/4 ^ 2 _ 

a 2 o~ 

To bound the middle part of (|2T7|) note that ^ t < (4<r 2 + and ^ 8 < 1. 
Thus, recalling that a 2 > 1, we obtain the simple bounds 

i 2 (4cr 2 +5) < (4ct 2 + S)- 1 ^ 1/4, at ct/(4ct 2 +<5) 1/4, <5/ 4 < V 4 - 

Applying Chebyshev's inequality on the last term of 12.17(1 we obtain 

The dioc case is analogous, using the right side of the minimum in (|2.10() instead 
and the remark after 112.411. □ 



Note that in the next corollary we do not assume that the £j have expectation 
zero. 

COROLLARY 2.3. Let W be the sum of independent and integer valued random 
variables {£i,i S J} with af = Var^ and 

vt = min{ 1/2, 1 - d T v (JS?(& ), JS?(& + 1)) } . 

Then, ifa 2 >l, 

2 E( ff f + ijE|&| 3 ) 1 + 2.25a- 1 + 0.25(T- 2 



d TV (J?(W),Bi(\4<T 2 ],l/2)*5 s ) < 
d loc (if(iy),Bi(r4 ( r 2 l,l/2)* ( 5 s ) < 



8E(^ 3 + ^ E ICi| 3 ) 3.25 + 0.25a- 1 



a 2 (V-Av*)+ a 2 
where s := — |~4<t 2 ]/2] , /i = EVK, V = EieJ Wi ffl7 ^ -y * = max ieJ w « 
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Proof. Define Wo = W — fi, and let t be denned with respect to Wo, taking 
a = —/j,. Then, as the metrics di are shift invariant, 

di(3?(W),Bi({4<T 2 }, 1/2) * 6.) = d l (^(W ),Bi(\4a^, 1/2) * 

(JS? (Wo), Bi( \4a 2 ] , 1/2 - *)) + d t (Bi( [4a 2 ] , 1/2 - f ), Bi( [4a 2 ] , 1/2)) 
= : i?! + R 2 , 

since Bi( [4ct 2 ] , 1/2 - i) * 5 S * 5^ = Bi( \4a 2 ] ,1/2 - t) . 

Applying Lemma fo. 21 to i?2 with the fact that ^ t ^ (4<t 2 + S)^ 1 gives 

(2.18) ii^ ( 7- 1 (l + (2a)- 1 + (4 ( 7 2 )- 1 ), i? 2 < a" 2 (1.5 + (4a)- 1 ). 

Define now q = max^j {D l (3?(W - &))}. Application of (fST^f) - (fS~TUf> yields 



(v- v*y/ 2 ' (v-iv*)+' 

Thus, application of Theorem l2.1l to R[ proves the corollary. □ 

3. Locally dependent random variables. In this section we present the 
main results of the paper. We exploit a finite local dependence structure as pre- 
sented in Chen and Shao |l2j . In the context of Stein's method for normal approx- 
imation, it has been successfully applied to a variety of problems; see for example 
Barbour ct al. J0 , Dembo and Rinott ^| an d Barbour and Xia Q • Note that 
Barbour et al. [6| use a slightly more general dependence structure, often yielding 
crucial improvements when approximating sums of dissociated random variables by 
the normal distribution. The generalization of Theorem 13. II is straightforward, yet 
somewhat tedious, and we therefore use the simpler dependence structure of Chen 
and Shao [J^i see the Appendix for the more general version, but without proof. 

Let i £ J} be a collection of random variables satisfying Assumptions G. For 
convenience, let £4 denote £ A} for every subset A C J. Assume further the 
following dependence structure: For every i £ J there are subsets Ai C Bi C J such 
that £j is independent of £a c , and is independent of Define r\i — X^gA £7 
and n = V, /; i,. 

Theorem 3.1. With W as above, 
(3.1) d l (&(W),Bi(\4o^^/2-t))^a- 2 (Y,*i,i + l-'Ts), * = 1 > 2 . 



where 

lti = ^{\^\ v 2 D l (^(W\^, Vi ))} +E{|£i7 ?i (T i - m )\D l (JZiW^u^n))} 
+ \E£ i r H \E{\n\D l (JZ>(W\n))} 
If further there are constants such that almost surely 

(3.3) D l (j?(W\£ Bi )) <Q,i, 
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then 

(3.4) W c M (i]B|^?7?| +E|6%(Ti-77i)| + |Efi7ft|E|Ti|). 

Proof. Estimate (I3.4|l is immediate. Following the proof of Theorem 12.11 and 
using Lemma \'3. 21 below. I|3.1[) is proved. □ 

Note that Theorem l2 . ll follows from Theorem l3.1l with the choices Ai = Bi = {i}. 

Lemma 3.2. Assume the conditions of Theorem \3.1\ Define A : F (TL + a) — » 
F(Ti + a) as in Lemma \2.<A Then, 

(3.5) \nAg)(W)\ *S (||A.g||^ M ) A (||ff|| $> 2 ,i). 

i£j ie.J 

Proof. We follow the proof of Lemma l2~2l right up to the end of the paragraph 
of fFTfy . Note now that 

(3.6) a 2 =^E{^} 

and that, by Taylor expansion, almost surely 
(3.7) 

U(w) = u(w - m) + &r)if'(w - m) + I f"(w -r H + «*) da, 



ZiVifiW - th) = CiVif'(W - n) + im{n - m) I f"(W + s( n - m )) ds ^ 

Jo 

miVi}f(W) = M{ZiVi}f'(W - n) + Mtfi^n [ f"(w + sn) ds. 

Jo 

Now, using the facts that E£i = 0, that & is independent of W — r\i and that r\i is 
independent of W — Ti , we obtain from 13.6fl and (|3.7() that 

M{Wf(W) - a 2 f'(W)} = £>{&/(W) - m>Vi}f(W)} 
= VeL 2 / ^-s)E{f"(W- Vi + sVi) | Zi,m}ds 

+ £,iVi(n - Vi) / ^{f"(W -n + s(n - to)) | rn, n} ds 
Jo 

-E{^}t,^ e{/"(W - n + s n )) | n }ds\. 

With (|2~T2) and jgTT5)> the lemma follows. □ 
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We now give a point process version of Theorem 13. II exploiting mainly the same 
dependency structure as before. 

Theorem 3.3. Let $ be a simple point process on a Polish space J with mean 
measure [i. For all points a £ J, assume that there are measurable subsets A a C 
B a C J, such that for every a £ J 

(3.8) J2f($ a (^))=JSP($(^)), 

(3.9) <fr a (A a ) and (fr a (B^) are independent, 

(3.10) &{A a ) and Q(Ba) are independent, 

where <I> Q denotes the Palm process at point a. Then, for W — $( J) — m(J) and if 
a 2 > I, 

d ; (^(W),Bi(r4a 2 ],l/2-t)) 

(3.11) „ f 

^ a' 2 / ^i(a)n{da) + 1.75a~ 2 , 1 = 1,2. 

JaE.7 

where, with $':=$ — fi and & a := 4> Q — /i, 

0,(a) = \M& a (A a )\m{\&(B a )\D l ($(B c a ) | *(B a ))} 
+ iE{$^(A Q ) 2 J D'($ Q (^) |* a (A,))} 

(3.12) + ±E{$'(A Q ) 2 £>'($(y£) | $(A Q ))} 

+ E{K(A Q X(B Q \A Q )| J D i ($ Q (B^) | $ Q (A Q ),$ Q (£ Q ))} 
+ E{|$'(A c 0$'(B Q \A a )|£>< | $(A a ),$(B a ))}. 

Proof. Following the proof of Theorem 12 . 1 1 and Lemma \'Z. 21 it is clear that we 
only have to bound E{W/(W) - cr 2 f'{W)} for / defined as in (l2~TTT) . In what 
follows, all integrals are taken over {a £ J} if not otherwise stated. Note first that, 
because of (I3.8f) . 

a 2 = E{$(J)$'(J)} = | M (da)E{<(A Q ) + <(^)} = J (,(da)W a (A a ) 
and hence with Taylor expansion 

a 2 E/W = | »(da)m' a (A a )Mf'(&(B c a )) 

+ J fx(da)m' a (A a )M^'(B a ) J f"(&(B c a )+t&(B a ))dt 
= : R\ + i?2- 
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Now, again by Taylor, 

nWf(W)}= [ M (da)[E/(^(J))-E/($'(J))] 

Mdo)[E/(*U^))-E/(*'(^))] 
\(da)\lS{^Mf'^' a (A%))}-lS{^(A a )f'^'(Al))} 



(i(da) 



- E 

i?3 + i?4 + -R5 



{<(A Q ) 2 ^(1 - t)/"(*' (A°) + t& a (A a )) dt} 
{<t>'(A a ) 2 f\l - t)f"($'(A c a ) + W{A a )) dt) 



and 



R 4 = / n(da)[lZ{K(A«)f'(& a (B c a ))} -E{$'(A Q )/'($'(^))}] 



li(da) 



{& a (A a )$' a (B a \ A a ) J f"{& a {B c a ) + t$' a (B a \ A a )) dt} 
-M{&(A a )&(B a \A a ) J f"(^(B c a )+tM>'(B a \A a ))dt} 



=: Rq + Rj. 

Using (|3.8|) - H3.10(1 . we see that R3 = and R\ = Re, hence 

\M{Wf(W) - a 2 f(W)}\ < \R 2 \ + \R 5 \ + \R 7 \ 
With (|2.15j) we finally obtain 



\R 2 \ < ||Aff|| y /i^lE^^)^!^'^)^ 1 ^^^) I $(B Q ))]}, 

|i? 5 | < IHA5II J ^)\E{& a {A a ?D x [JZ{$ a {A c a ) I §„(A*))]} 

+ E{$'(A Q ) 2 ,D 1 [j^(<i>(^) I $(A a ))}} 

\R 7 \ < ||A.9|| y Ai (da)[E{|$^(A Q X( J B Q \A Q )| J D 1 [Jf($ Q (^) I § a (A,),$ a (B a ))]} 

+ E{|$'(A a )<I>'(i? ct \A ct )|^ 1 [^f($(^) I $(A a ),$(B a ))]}~ 

To obtain $2, just replace || Ag\\ by ||g|| and D 1 by D 2 in the above bounds . □ 

Corollary 3.4. Le£$ be a simple point process satisfying Q3.8[I - Q3.10JI . If there 
is further a function ci(a), such that for fi-almost every a £ J almost surely 



(3.13) D l [3?($(J) I $\ Ba )],D l [j?($ a (J) I *„|bJ] < c t (a) 



1 = 1,2, 



SYMMETRIC AND CENTERED BINOMIAL APPROXIMATION 



11 



then l|3.12|) satisfies 



(3.14) 



0,(a) < q(a) [|E<(^ Q )|E|$'(B Q )| + ±E<(A Q ) 2 + ^'{A a f 
+ E\& a (A a )<f>' a (B a \ A a )\ + E\&(A a )&{B a \ 
c,(a) [l.5E{* a (A»)*a(-Ba)} + 1.5E{$(A Q )$(B Q )} 
+ 6/i(A Q )/i( J B Q ) + 4 A i(B a )E$ a (B a ) . 



4. Applications. In what follows, we calculate only rough bounds, leaving 
much scope for improvement. In particular, we replace the moments in the estimates 
by almost sure bounds. 



4.1. Exceedances of the r -scans process. We follow the notation of Dembo and 
Karlin |l4j . Let Xi,X%, . . . ,Jf n+r _i be independent and identically distributed 
random variables with distribution function F. Define the r-scan process Ri = 
Y?k=o Xi+k, i = 1,2, ... ,n and further Wf = I[Ri < a] for a £ R. We are inter- 
ested in the number N~ = J27=i ^i~> * na ^ ^ s ^ ne number of Ri not exceeding a. 
With p = BW.r = P[i?i < a], we have E7V~ = np and 

r-l 

(4.1) cr 2 = VaxW = np(l-p + 2^(l-d/n)V>(d))> 

where -0(d) = P[R d +i < a|i?i < a] - p ^ 0. 

Poisson approximations for the r-scan process have been extensively studied 
by Dembo and Karlin [J^j . Normal approximation has been considered by Dembo 
and Rinott 0]; in particular they show, that, for fixed r and a, N~ converges 
in the Kolmogorov metric to the normal distribution with rate 0(n -1 / 2 ). In the 
next theorem we achieve the same rate in total variation, and also a rate for the 
corresponding local limit approximation. 

Theorem 4.1. Assume that F is continuous, F(0) = 0, and < F(x) < F(y) 
for all x < y, and let a > be fixed. Then, for all n such that a 2 > 1, 

di(jSf(JV" -np),Bi(\4a 2 ], 1/2- t)) < C ; n"</ 2 , 1 = 1,2, 

where the constants C± and C2 are independent of n and can be extracted from the 
proof. 

PROOF. We apply Theorem IO for W = £"=1 6 = EiUO^T ~P)- Wc can sct 

Ai = {i - r + 1, . . . , i + r - 1} n {1, . . . , n}, 
Bi = {i - 2r + 2, . . . , i + 2r - 2} n {1, . . . , n}. 
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Then, as \A, t \ < 2r - 1, \Bi\ ^ 4r - 3 and \Bi\Ai\ ^ 2r - 2, the following rough 
bounds are obvious: 

E|&^| < (2r - l) 2 , E^fa - 7 7j )| < (2r - l)(2r - 2), 
|E&?7i|IEN < (2r-l)(4r-3), 

thus 

(4.2) % ^c M (l6r 2 -20r + 6) 

Consider now the block By = J^Zi W t ~ , and assume that the values dB\ = 
(Xi, . . . , X r -i) and dB 2 = (^3r-i> ■ ■ ■ , -^4r-2) are given. Define the events 

.4 := {a/r <X r ,..., X 2r _ 2l X 2r+1 , . . . , X 3r _ 2 < o(r + l)/r 2 , < X 2r «C a/(2r 2 )} 
Ao ■= {a/r < X 2r _! < o(r + l)/r 2 }, Ai := {0 < X 2r ^ < a/(2r 2 )}. 

Due to the conditions on F and independence it is clear that pj := P[„4n Aj] > 
for j = 0, 1. Note now that 

2r-l 

= > a on iflA), R r < a oniflii. 

z— r 

Note further that R s < a for all s = r + 1, . . . , 2r — 1 on An (.Ao U _4i). Hence 

2r-l 2-r- 1 

^ VKf = r - 1 on AnA , ^2 W~ = r oniflii. 

It easy to see now by a coupling argument that 

\D 1 (^{B 1 )) O-OoApx) < 1. 

Noting that by sequentially stringing together blocks like £?i, wc can have m := 
[n/(3r — 2) J such blocks, which are independent given all the borders dBi. Fur- 
thermore, for every i, the Rj in Bi depend on the Xk of at most two such blocks. 
Therefore, defining Z = {dB\, . . . , dB m ) and using l|5.11J) and (|5.12|) . 

D 1 (if (W\8Bi, i = 1, . . . , m)) < =: c M , 

(min{l/2,p , Pl }(™-2)) 1/2 

# 2 (if (W\8Bi, i = 1, . . . , m)) < . J1/9 ^7 tt =: 

mm{l/2,p ,Pi}(TO - 4)+ 

Clearly, cj^ = 0(n - '/ 2 ). Hence, putting this, l|4.1|l and l|4.2|) into (|3.1[1 . the theorem 
follows. □ 
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4.2. Matern hard-core process type I. We approximate the total number of 
points of the Matern hard-core process type I introduced by Matern 0. We use 
rectangular instead of the usual circular neighborhoods. Let $ be the process on 
the ei-dimensional cube J = [0, l) d C R d defined as 

T 

= AXi e B]I[Xj i K r {Xi) for all j = 1, . . . , r, j ^ i], 

i=l 

where r ~ Po(A) and {Xi; i £ N} is a sequence of independent and uniformly 
distributed random variables on J and where, for x = (x\, . . . , Xd) € J and r > 0, 
K r (x) denotes the d-dimcnsional closed cube with center x and side length r. To 
avoid edge effects, we treat J as a d-dimensional torus, thus identifying any point 
outside J by the point in J which results in coordinate- wise shifting by 1 . The pro- 
cess $ is thus a thinned Poisson point process with rate A having all points deleted 
which contain another point in their K r neighborhood. For the mean measure /i of 
$ we obtain 

(4.3) ^ = Ae" c . 

dx 

We are now interested in the distribution of &(B) when r is small an A large. 

Theorem 4.2. Put W := $( J) - /i(J) and let a > be a fixed real number. 
Then, for every A and r such that Xr d — a and a 2 :— V&rW > 1, 

ek(&(?(J) - n(J)),Bi({4o*\,l/2-t)) < d\- 1 ' 2 , 1 = 1,2, 

for constants C\ and C2 which are independent of A and can be extracted from the 
proof. 

PROOF. We apply Corollary 13.41 We can take A x = K 2r (x) and B x = Ki r (x) 
and check that the conditions (|3.8|l - l|3.10|) arc fulfilled. Some calculations show that 
the reduced second factorial moment measure M satisfies 



dM(x) 
dx 



if x € J\ ? .(0), 

X 2 e -\\K„(0)UK„(x)\ if x g # 2r ( ) \ ifr ( ) j 

A 2 e~ 2a ifx<£K 2r (0), 



compare with Daley and Vere- Jones [H pp. 367, 373]. Thus, M(J) > A 2 e~ 2a (l-r d ) 
and 

(4.4) a 1 = Xc- a + M{J) - /i(J) 2 ^ Ac- Q (l - ae~ a ). 

Since we can have at most 7 d points of $ in B x , wc obtain from H3.14|l the rough 
estimate 



(4.5) 



#t(x) < 26-7 d Q (x), 
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Fig 1. Matern hard-core process type I: Given that the process <E> is known on the borders 
^lsM^6r(^i) \ K$ r (xi) (grey area), the boxes &\k St .(xi)i ' 6 M., are independent. 



where c;(-) is as in (|3.13|) . To estimate ci(x) write K r = K r (0). We have 

P[*(Jf r ) - | *j fTr \ jr5r ] > Po(A|/^|){0} = c- a =: po, 
P[$(/v r ) = 1 | $K 7r \K 5T ] > Po(A|^ 3 r \ K r \){0} • Po(A|X r |){l} = ae" 3 " 
Hence, by a coupling argument, 

(4.6) \D x (X{§{K 6r )\$ K7AK5r )) < 1 - (po Api) < 1. 



Pi- 



Let now x be arbitrary. Divide the space J into boxes of side length 6r, centered 
around x (see Figure^. With m := [1 /(6r)J , we can have m d such boxes plus a 
remainder. Denote this remainder by J R and denote by xi, I £ {1, . . . , m} =: .M 
the centers of the boxes where a?i i = x. Note now that, given $ on all the borders 
K&r [xi ) \ K§ r (xi ) , Z £ Ai (grey area in Figure^]), the random variables ^(K^ r (xi)), 
I £ A4, are independent and satisfy inequality l|4.6|l . Furthermore, &\j\K 6r (x) i s 
independe nt of $ | B and therefore , defining Z = (($\K 6r .(xi)\K 5r (.x,))l£M>$\j*) 
and using l|5.11|l and (|5.12(l . we obtain 



(4.7) D^JZ^iJ) | *|b.)) < 



(4.8) 



D 2 (j§f($(J) | *| fl J) < 



min{l/2,poiPi}( TO<i 



i{l/2,p ,p 1 }( ? n d -3)- 



) 1/2 
=: c 2 (x) 



ci(x), 



Noting that almost surely Jtf[<b x (j\K 6r (x)) | $z|b x ] = J?[$(J\K 6r (x))], we see 
that (|4.7(l and 1)4.8(1 hold also for <& x , thus q(x) satisfies (|3.13l) . Now, recalling that 
a = Ar d is constant, we have q(x) = 0(A _i / 2 ). Hence, putting this and (|4.3[) - H4.5jl 
into (|3.11|) . the theorem follows. □ 
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5. Appendix. 

5.1. Properties of the solution to the Stein equation. 

Lemma 5.1. For any indicator function h(z) = I[z G A], z G TL, A C 7L, the 
solution g = g^ to the Stein equation (|2.r?|l satisfies 

(5.1) ||p|| < 1 A (npqy 1 / 2 . 

PROOF. We apply the generator method introduced by Barbour [l|. For any 
function / : {0, . . . , n} — ► H, define 

(5.2) (Af)(z) = (B(-Af))(z) = qzf(z-l)-(qz+p(n-z))f(z)+p(n-z)f(z + l), 

which is the infinitesimal generator of a pure jump Markov process. A solution g 
to Ij2.3|) is now given through 

/>OC 

iP(z) = - E,{h(Y z (t)) -h(Y)}dt, for «€{(),..., n}, 
Jo 

and g(z) = —Ai(j(z) for z € {0, . . . , n — 1} and g(z) = else, where Y z is a Markov 
process with generator A starting at point z, and Y is a random variable having 
the stationary distribution Bi(n, p). Now, we have for z G {0, . . . , n — 1}, 

/>oo 

(5.3) a^c?)= / E{/»(y,(t))-ft(n +1 (t))}dt. 

Jo 

We now fix z and construct a coupling of y z and Yj+i to bound (|5.3(1 . Let thereto 
^fc 1 ^)' & e {1, . . . , n}, i G {0,1}, be independent Markov processes with state 
space {0, 1}, starting in point i and having jump rate p if the process is in and q 
otherwise. It is easy to see by the Kolmogorov differential equations that 

(5.4) ~ Bc(p + qe-% xf\t) ~ Bc(p - per*) 



where Be(p) denotes the Bernoulli distribution with success probability p. Let r 
be the minimum of tl 
define a new process 

X(t) = 



be the minimum of the first jump times of the two processes X^+i and X^)^, and 

ifr>t, 



describing the well-known Doeblin coupling. Then, let 

z n 

(5.5) Y z = ^ X? + J2 4 0) - Y z+1 =Y Z - + X(t), 

fc=l k=z+l 

and one proves that Y z and i^+i are Markov processes with generator (|5.2(l . Hence, 
we can write (|5.3|) as 

POO 

(5.6) -A^(z)= / e"*E{A/i(y z )} dt, 

Jo 
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since r is exponentially distributed with rate 1. The bound \\g\\ ^ 1 is now im- 
mediate from H5.6f) . thus we may assume that npq > 1. Note that, from (|5.4|l and 
(EH), 

3?(Y Z ) = Bi(z,p + <?e~') * Bi(n - z,p - pe _t ), 
and hence, from Barbour and Jensen 0, Lemma 1] , 

D l {#(Y z )) < Var^fe)- 1 / 2 

(5.7) < (*(p + ge-*)(« - ge"*) + (n - z)(p -pe"% +pe^))" 1/2 

(np^l-e"*))" 172 . 

Note also that for h := h — 1/2 

(5.8) \E{Ah(Y z )}\ = \E{Ah(Y z )}\ < D 1 ^ (F z ))/2. 

Thus, applying 1)5. 8|> on (|5.6(l and using (|5.7(l . 

/" s 1 f 00 e~* 

AV> < / e"* dt + 7 —== / cfe. 

Choosing s = — ln(l — (npq)^ 1 ^ and computing the integrals proves the lemma. □ 
5.2. Change of the success probabilities. 

Lemma 5.2. For every n£N,0<j)<l and — (1 — p) < t < p 



"TV (Bi(n,p-*),Bi(n,p)) < |£| 
dioc(Bi(n,p-t),Bi(n,p)) < \t\ 



P-t | y/{p-t)(q + t) 
pq pq pqy/n 



l+p-t | y/(p-t)(q + t) 
pq pq^/n 



PROOF. We use Stein's method. If W ~ Bi(n,p — t), we obtain from (|2.1|) 
and 

E{(1 - p)Wflr(W - 1) - p(n - W)3(W)} = E{tWAg(W - 1) - tn#(W)} 

for every bounded function g <G F(Z). The left side is just the Stein operator for 
Bi(n, p) hence, taking g = gA obtained by solving (|2.3() for Bi(n, p), with the bounds 
and (JS3J the d TV -bound follows, noting also that M\W\ |EW| + V^aFW. 
With the remark after 1)2. 4|l . the c?i oc -bound is proved. □ 

5.3. Smoothing properties of independent random variables. In several parts of 
this paper, we have the situation that we need to estimate D m (U), m = 1,2, for 
some integer valued random variable U, being a sum of some other random variables. 
If the U is a sum of independent random variables, we can proceed as follows. 
Assume that U = Y2i=i wnere the -^Q are independent. Defining Vi = min{A, 1 — 
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^D 1 (Xi)} and V = J2i v i we obtain from Barbour and Xia 0, Proposition 4.6] the 
bound 

(5-9) D\U) < 

Define further v* = max,^. Now it is always possible to write U = EA 1 ' + 

in such a way that the analogously defined numbers and satisfy ^ 

V/2 - v*, k = 1, 2. Using O and J5JJI, we obtain 

(5.10) £ 2 (io < dHu^AdHu^) «c — — < — i r . 

5.4. Smoothing properties of conditional independent random variables. Inmost 
applications, U is a sum of dependent summands and we can not apply l|5.9|l and 
(|5.10() directly. However, assuming that there is a random variable Z on the same 
probability space as U such that ££{U\Z = z) can be represented as a sum of 
independend summands, say x[ z \ i = 1, . . . , n z , for each z that Z can attain, we 
can still apply H5.9JI and (|5.1()(l . and we obtain 

(5.11) I> 1 (C0<E{E[D 1 (Cr)|Z]}<E{-|^} J 

(5.12) £ 2 ([/) ^ E{E[-D X (C/)|Z]} ^ E{ (v ^_ 8 )+ }, 

where, for each z, and v* are the corresponding values as defined in subsection 
15.31 with respect to the x[ z \ 

6. Appendix. We now give a generalization of Theorem 13.11 The proof is 
omitted, because it runs analogously to the proof of Theorem l3.ll see also Barbour 
ct al. 0. 

Suppose that a random variable W satisfies Assumptions G and assume that 
there are sets Ki C J , i G I , and square integrable random variables Z^, and 
Vik, k £ Ki and i G /, as follows: 

(6.1) W = Wi + Zi, i G I, where Wj is independent of 

(6.2) = Y, 

(6.3) Wi = w ife + v;fc, ie/, ta„ 

where Wis, is independent of the pair (Xi,Zn.)- 
Theorem 6.1. With W as above, 

(6.4) d / (^(iy),Bi([4a 2 l,l/2-t)) ^a- 2 (^^ M + 1.75), 1 = 1,2, 

iei 
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±E{\t i \Z?D l (Sf(W i \Z i ,Z i ))} 
+ H\& z ikVik\D l (3?(W ik \Zi,Z ik ,V ik ))} 

+ \E{^Z lk }\ 1 E{\Z l + V lk \D l (j?{W lk \Z l ,V lk ))}. 

Acknowledgments. I thank A. D. Barbour, D. Schuhmachcr and B. Ni- 
ctlispach for many helpful discussions. 
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